Kurdish Kurmanji Lemmatization and Spell-checker with Spell-correction
نویسندگان
چکیده
There are many studies about using lemmatization and spell-checker with spell-correction regarding English, Arabic, Persian languages but only few found low-resource such as Kurdish language more specifically for Kurmanji dialect, which increased the need of creating systems. Lemmatization is process determining a base or dictionary form (lemma) specific surface pattern, whereas spell-checkers spell-correctors determine whether word correctly spelled also correct range spelling errors, respectively. This research aims to present word-level error correction system Dialect, first tools this dialect based on our knowledge. The proposed approach built morphological rules, hybrid that relies n-gram model Jaccard Coefficient Similarity algorithm was applied spell-correction. results lemmatization, detailed in article, rates 97.7% 99.3% accuracy noun verb correspondingly. Furthermore, spell-correction, accordingly, 100% 90.77% attained.
منابع مشابه
Khmer Spell Checker
Khmer is the official language of Cambodia. It is a complex language. Similar to Chinese, Japanese and Thai, Khmer words are written without spaces or other word delimiters. This is a major challenge in spell checking Khmer since there is no simple way to determine word boundaries. However, it is feasible to spell check Khmer. The process of spell checking Khmer is different from the spell chec...
متن کاملA Novel Binary Spell Checker
In this paper we propose a simple, flexible and efficient hybrid spell checking methodology based upon phonetic matching, supervised learning and associative matching in the AURA neural system. We evaluate our approach against several benchmark spell-checking algorithms for recall accuracy. Our proposed hybrid methodology has the joint highest top 10 recall rate of the techniques evaluated. The...
متن کاملA Spell Checker for a World Language: The New Microsofts Spanish Spell Checker
This paper reports work carried out to develop a speller for Spanish at Microsoft Corporation, discusses the technique for isolatedword error correction used by the speller, provides general descriptions of the error data collection and error typology, and surveys a variety of linguistic considerations relevant when dealing with a world language spread over several countries and exposed to diff...
متن کاملDesign and Implementation of Punjabi Spell Checker
Spellcheckers are the basic tools needed for word processing and document preparation. Designing a spell checker for Indian languages such as Punjabi poses many new challenges not found in English, which complicates the design of the spell checker. Punjabi language is far different from Western languages in phonetic properties and grammatical rules. Thus the existing algorithms and techniques t...
متن کاملAn extended spell checker for unknown words
Spell checking is considered a solved problem, but with the rapid development of the natural language processing the new results are slowly extending the means of spell checking towards grammar checking. In this article I review some of the spell checking error classes in a broader sense, the related problems, their state-of-the-art solutions and their different nature on different types of lan...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: UHD journal of science and technology
سال: 2023
ISSN: ['2521-4209', '2521-4217']
DOI: https://doi.org/10.21928/uhdjst.v7n1y2023.pp43-52